Coreferent Mention Detection using Deep Learning
نویسندگان
چکیده
A mention may or may not be coreferred elsewhere in the document. Identifying those mentions that are corefered (called coreferents) is an important step in many NLP tasks, like coreference resolution. To classify a mention as singleton or coreferent using just one sentence is a challenging problem, but previous work suggests that there are cues in a sentence which can be used to predict if a mention it contains is corefered elsewhere in the document. We approach this problem in two different ways. First, we try to a classify a mention candidate extracted by rules-based methods as a coreferent or singleton using various hand-crafted features from literature and a Feed Forward Neural Network (FFNN). Second, we approach this problem as identifying coreferent mention boundaries in input sequences using Recurrent Neural Networks. This is a two step process where we first detect mention heads, and then the mention boundaries. Both these parts are trained and evaluated independently. The second approach removes our dependency on an upstream rule-based mention extractor and hand-crafted features for classification, some of which are expensive to compute. Our hypothesis is that the second approach would be able to learn those hand-crafted features (and more) automatically and perform better at the task. We observed that both of our approaches outperformed the baseline logistic regression model which uses all the hand-crafted features of the first approach, showing that deep neural networks can be used effectively for coreferent mention detection. We also observed that the presence of hand crafted features from literature helps and the first approach (FFNN) outperformed the second (RNN) by approximately 3 F1 points. We also compare our results to published results for the coreferent mention detection task.
منابع مشابه
Supervised Models for Coreference Resolution
Traditional learning-based coreference resolvers operate by training a mentionpair classifier for determining whether two mentions are coreferent or not. Two independent lines of recent research have attempted to improve these mention-pair classifiers, one by learning a mentionranking model to rank preceding mentions for a given anaphor, and the other by training an entity-mention classifier to...
متن کاملThe Same-Head Heuristic for Coreference
We investigate coreference relationships between NPs with the same head noun. It is relatively common in unsupervised work to assume that such pairs are coreferent– but this is not always true, especially if realistic mention detection is used. We describe the distribution of noncoreferent same-head pairs in news text, and present an unsupervised generative model which learns not to link some s...
متن کاملNarrowing the Modeling Gap: A Cluster-Ranking Approach to Coreference Resolution
Traditional learning-based coreference resolvers operate by training the mention-pair model for determining whether two mentions are coreferent or not. Though conceptually simple and easy to understand, the mention-pair model is linguistically rather unappealing and lags far behind the heuristic-based coreference models proposed in the pre-statistical NLP era in terms of sophistication. Two ind...
متن کاملDetecting Overlapping Communities in Social Networks using Deep Learning
In network analysis, a community is typically considered of as a group of nodes with a great density of edges among themselves and a low density of edges relative to other network parts. Detecting a community structure is important in any network analysis task, especially for revealing patterns between specified nodes. There is a variety of approaches presented in the literature for overlapping...
متن کاملImproved Best-First Clustering for Coreference Resolution in Indian Classical Music Forums
Clustering step in the mention-pair paradigm for coreference resolution, forms the chain of coreferent mentions from the mention pairs classified as coreferent. Clustering methods including best-first clustering considers each antecedent candidate individually, while selecting the antecedent for an anaphoric mention. Here we introduce an easyto-implement modification to best-first clustering to...
متن کامل